Generalizing from Example Clusters

نویسندگان

  • Pan Hu
  • Celine Vens
  • Bart Verstrynge
  • Hendrik Blockeel
چکیده

We consider the following problem: Given a set of data and one or more examples of clusters, find a clustering of the whole data set that is consistent with the given clusters. This is essentially a semi-supervised clustering problem, but it differs from previously studied semi-supervised clustering settings in significant ways. Earlier work has shown that none of the existing methods for semi-supervised clustering handle this problem well. We identify two reasons for this, which are related to the default metric learning methods not working well in this situation, and to overfitting behavior. We investigate the latter in more detail and propose a new method that explicitly guards against overfitting. Experimental results confirm that the new method generalizes much better. Several other problems identified here remain open.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using data envelopment analysis (DEA) to improve the sales performance in Iranian agricultural clusters by utilizing business networks and business development services providers (BDSPs)

Business clusters play an important role in developing and improving the economic performance of countries and in promoting the welfare of people. Business development service providers (hereafter referred to as, BDSP) have a considerable role in providing specialized services pertinent to the conditions of active enterprises in clusters and in promoting their performance level in order to impr...

متن کامل

Interspecies interactions of halophilic and halotolerant actinomycetes: An example from a salt

Interspecies interaction of actinomycetes will express new gene clusters and may therefore affect the pigmentation, sporulation and production of secondary metabolites. Actinomycetes strains were isolated from Howze Soltan Salt Lake. Binary actinomycete interaction assay was conducted to evaluate its effect on colony morphology and antibiotic production. The molecular identification of the indu...

متن کامل

Geometric and Electronic Structures of Vanadium Sub-nano Clusters, Vn (n = 2-5), and their Adsorption Complexes with CO and O2 Ligands: A DFT-NBO Study

In this study, electronic structures of ground state of pure vanadium sub-nano clusters, Vn (n=2-5), and their interactions with small ligands for example CO and triplet O2 molecules are investigated by using density functional theory (DFT) calibration at the mPWPW91/QZVP level of theory. The favorable orientations of these ligands in interaction with pure vanadium sub-nano clusters were determ...

متن کامل

Cluster-Lift Method for Mapping Research Activities over a Concept Tree

The paper builds on the idea by R. Michalski of inferential concept interpretation for knowledge transmutation within a knowledge structure taken here to be a concept tree. We present a method for representing research activities within a research organization by doubly generalizing them. To be specific, we concentrate on the Computer Sciences area represented by the ACM Computing Classificatio...

متن کامل

P75: A Study of Perfectionism, Anxiety Sensitivity and Sleep Disturbance in the Generalizing Anxiety Disorder and Normal People

Perfectionism, anxiety sensitivity and sleep disturbance are among the main causes of generalizing anxiety disorder. This study aims to compare perfectionism, anxiety sensitivity and sleep disturbance between patients with generalizing anxiety disorder (GAD) and control group. The present study was a cross-sectional and ex-post facto investigation (causal comparative method). Statistical univer...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013